Picture for Qi Gu

Qi Gu

MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft

Add code
May 29, 2026
Viaarxiv icon

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Add code
May 27, 2026
Viaarxiv icon

GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

Add code
May 27, 2026
Viaarxiv icon

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Add code
May 26, 2026
Viaarxiv icon

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

Add code
May 26, 2026
Viaarxiv icon

When to Stop Reusing: Dynamic Gradient Gating for Sample-Efficient RLVR

Add code
May 19, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Add code
Apr 20, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon

$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts

Add code
Mar 11, 2026
Viaarxiv icon